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Abstract 


4' 

The  problem  addressed  in  this  paper  is  heunsticaily-guided  learning  of  finite  automata 
from  examples.  Given  positive  sample  strings  and  negative  sample  strings,  a  finite 
automaton  is  generated  and  incrementally  refined  to  accept  all  positive  samples  but  no 
negative  samples.  This  paper  describes  some  experiments  in  applying  hill-climbing  to 
modify  finite  automata  to  accept  a  desired  regular  language.  We  show  that  many  problems 
can  be  solved  by  this  simple  method.  We  then  describe  the  method  hew  to  "re- construct' 
a  finite  automaton  if  the  positive  and/or  negative  samples  are  slightly  altered,  without 
starting  from  the  beginning.  Finally,  we  have  an  actual  system.  RR:  Regular  set 
Recognizer,  that  learns  to  recognize  a  regular  set  from  the  samples  that  are  given  by  a 
human  teacher  one  by  one. 
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INTRODUCTION 


1 .  Introduction 


Consider  the  following  problem: 

Describe  the  property  that  all  strings  in  the  right-list  have  but  no  string  in  the  wrong-list  has.  Does  a 
string  (110  1)  have  this  property?  You  may  answer  the  question  by  using  any  of  the  following: 
English,  a  regular  expression,  or  a  finite  automaton.1 


right-list 

0 

(1) 

(0) 

(01) 

(1  D 
(00) 

(10O 

(110) 

(HI) 

(000) 

(100100) 

(11000001  1  1  00001) 
(11110110001001110O 


wrong-list 

(10) 

(101) 

(010) 

(1010) 

(1110) 

(1011) 

(10001) 

(1 11010) 

(1  001000) 

(1  1  111  000) 
(0111001101) 
(11011100110) 


K  might  be  possible  to  construct  the  finite  automaton  by  a  "typical"  schema-filling  method  (i.e., 
finding  rough  property  in  the  samples  first,  comparing  these  strings  carefully).  However,  in  this  paper, 
we  try  to  construct  the  finite  automaton  directly  by  searching  in  the  problem  space  (i.e.,  the  set  of  all 
finite  automata)  using  hill-climbing,  rather  than  by  analyzing  the  samples  carefully.  One  of  the  biggest 
advantages  of  hill-climbing  is  its  simplicity,  that  is,  we  do  not  have  to  know  our  problem  space  well, 
while  a  "typical”  schema-filling  method  requires  us  to  provide  alt  possible  schemas,  and  therefore  to 
know  everything  about  our  problem  space.  We  shall  see  that  hffl-cflmbing  works  much  better  than 
expected  in  our  problem  space,  and  in  fact  solved  most  of  the  problems. 


1 .1  The  finite  automata  used  in  this  paper 

We  restrict  our  problem  domain  to  be  only  over  (1,0)*.  Furthermore,  since  every  non -deterministic 
finite  automaton  has  an  equivalent  deterministic  finite  automaton  (see  [Hopcroft  79]),  we  deal  only 
with  deterministic  finite  automata,  that  is,  there  is  at  most  one  1-arrow  and  one  0-arrow  from  each 
state.  Thus,  in  this  paper,  the  terms  "finite  automaton”,  "automaton”  or  "machine”  all  mean 
"deterministic  finite  automaton".  Given  a  string  s,  if  there  is  a  transition  from  the  initial  state  to  any  of 
the  final  states,  then  s  is  accepted  by  the  machine,  otherwise  s  is  rejected.  For  example,  the  machine 
of  the  sample  problem  is  shown  in  figure  1-1. 


1  * 

The  newer  it  airings  over  (14-0)  without  an  odd  number  of  consecutive  0 a  AFTER  an  odd  number  of  consecutive  Vs. 
Therefore  (t  1 01)  has  the  property- 


a 
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Figure  1-1 :  The  machine  of  the  sample  problem 


Each  machine  with  n  states  is  denoted  by  the  following  form: 
((AvB1,F1)(A2,B2,F2)....(An,Bn,Fn)). 

Each  (A.,  Bj,  F.)  corresponds  to  the  state  i,  and  A,  and  B,  indicate  the  destination  states  of  the  O-arrow 
and  the  1 -arrow  from  the  state  i,  respectively.  If  A,  or  B,  is  zero,  then  there  is  no  O-arrow  or  1- arrow 
from  the  state  i,  respectively.  F,  indicates  whether  state  i  is  one  of  the  final  states  or  not  If  F,  is  equal 
to  1,  the  state  i  is  one  of  the  final  states.  The  initial  state  is  always  state  1.  For  instance,  figure  1-1  is 
represented  as  follows: 

((121)(311)(400)(341)). 


1.2  The  problem 

We  now  are  ready  to  describe  the  problem  predaeiy.  Given  a  right-list  (a  set  of  positive  sample 
strings)  and  a  wrong-list  (a  set  of  negative  sample  strings),  we  can  think  of  the  following  three  tasks: 

1 .  To  find  a  machine  that  accepts  all  strings  in  the  right-list  but  none  in  the  wrong-list 

2.  To  find  a  machine  with  n  states  that  accepts  all  strings  in  the  right-list  but  none  in  the 
wrong-list 

3.  To  find  the  machine  with  fewest  states  (simplest  machine)  that  accepts  all  strings  in  the 
right-list  but  none  in  the  wrong-list 

The  first  task  is  trivial  because  one  can  easily  construct  a  trivial  machine  that  accepts  exactly  all 
strings  in  the  right-list  but  nothing  else.2  The  second  task  and  the  third  task  are  shown  to  be  NP 
complete  problems  by  [Gold  74].  We  call  the  second  task  construction  of  finite  automata,  and  the 
third  task  simplification  of  finite  automata. 


1 .3  Past  Work 

Feldman,  Gips,  Homing  and  Reder  [Feldman  67]  [Feldman  69]  built  a  system  that  constructs  a 
grammar  in  BNF  from  given  examples.  It  takes  only  positive  examples,  and  its  problem  domain  is 
context-free  languages.  We  quote  a  couple  of  sample  runs  of  this  system  from  [Feldman  68] ,  to  make 
dear  how  their  system  worked. 


2  An  example  of  the  trivial  macNna  vrf*  be  found  in  section  3-4-2. 
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Figure  1  -2:  Sample  Strings  and  BNF  grammar  produced  by  Feldman’s  system 


(b) 

(a  a  b) 

(a  ■  a  ■  b) 

(a  a  a  a  a  »  b) 

S  <-  b  |  S,nS 
St  <-  a 


(c  d) 

<*  b  d) 

(a  c  b  d) 

(a  a  b  b  d) 

(a  a  c  b  b  d) 
(aaabbbd) 

(a  a  a  e  b  b  b  d) 


S  <-  S.d 
Si  <*  «»  I  e 
sj  <-  |  b 

— iw— ew  mct 

(•  ••) 

(1  a  •  a  r  a  a) 

(a  a  1  a  a  a  r) 

(1  a  a  a  r  a  1  aaar) 
(T  a  a  1  a  a  a  r  r  a  a) 
(1  1  aaaraaraa) 
(aal  a  a  1  aaarr) 
(a  a  1  ~1  a  a  a  r  a  a  r) 

S  <-  S.aS. 

Jt  <-  I  • 

S,  <-  Sr 


Their  system  first  constructs  a  "trivial"  grammar,  and  then  simplifies  it  As  we  can  see,  their  system 
requires  us  to  provide  nicely-chosen  examples,  and  it  cannot  solve  from  poorly  organized  examples 
such  as  the  problem  we  introduced  at  the  beginning. 

Bierman  and  Feldman  then  buitt  a  system  that  constructs  a  finite  automaton  from  given  examples. 
Although  it  takes  only  positive  examples,  they  showed  an  application  to  the  case  where  both  positive 
and  negative  examples  are  given.  Their  algorithm  also  requires  nicely-chosen  examples,  and  they 
showed  the  method  to  choose  the  examples  from  a  regular  set  ’’nicety",  so  that  it  always  turns  out  the 
simplest  machine.  However  if  the  examples  are  not  nicely-chosen,  as  in  the  problem  we  introduced  at 
the  beginning,  their  system  hardly  turns  out  the  simplest  machine. 

Apart  from  the  grammatical  inference,  there  has  been  a  good  deal  of  work  on  discovery  of  a 
regularity  or  a  common  pattern  in  the  given  examples  that  are  not  necessarily  nicely-chosen 
( [Langley  81a]  [Langley  81b]  [Buchanan  76]  [Hayes- roth  77]  [Michaiaki  73]  [Vere  75]  [Winston  70]). 
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1 .4  Overview  of  the  Paper 

In  the  rest  of  this  chapter,  we  present  the  7  sample  problems,  that  we  will  consider  throughout  this 
paper  (7  sample  problems  and  their  inverses). 

In  chapter  2,  we  present  the  result  of  an  experiment  jn  constructing  finite  automata  with  n  states 
using  hill-climbing,  in  particular,  we  let  n  ■  8.  We  shall  see  that  all  14  sample  problems  can  be  solved 
by  this  method. 

In  chapter  3,  we  present  the  result  of  an  experiment  in  simplifying  the  finite  automata  which  we 
have  found  in  chapter  2,  also  using  hill-climbin').  We  shall  see  that  we  can  find  the  simplest  machine 
for  most  of  the  problems  by  this  method. 

In  chapter  4,  we  discuss  re-construction  of  finite  automata,  that  is,  how  to  re-construct  a  finite 
automaton  if  the  right-list  and  the  wrong-list  are  slightly  altered.  We  might  not  want  to  construct  it 
from  the  beginning,  father,  we  want  to  construct  the  new  machine  by  modifying  the  previous 
machine. 

Finally,  we  have  an  actual  system  called  Regular  set  Recognizer  [RR],  using  the  techniques  above. 
RR  teams  to  recognize  a  regular  set,  given  examples  by  a  human  "teacher".  We  present  several 
sample  runs  as  well  as  a  user’s  manual,  in  chapters 


1 .5  Sample  Problems 


1 .5.1  Sample  Problems 

Throughout  this  paper,  we  consider  the  following  7  sample  problems. 

ProbfmU 

right-list 
0 
(D 
(ID 
(111) 

(1111) 

(11111) 

(111111) 

(1111 1.1 1) 

(11111111) 


wrong- Hat 
(0) 

(10) 

(01) 

(00) 

(011) 

(110) 

(11111110) 

(10111111) 
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Problem  2 


right- list 

wrong-list 

0 

(D 

(10) 

(0) 

(1010) 

(1  1) 

(101010) 

(00) 

(1  0 1  0 1 0 1  0) 

(01) 

(10101010101010) 

(101) 

(10  0) 

(1001010) 

(10110) 

(110101010) 

Problems3 

right-list 

wrong- list 

0 

(10) 

(D 

(101) 

(0) 

(010) 

(01) 

(1010) 

(1  D 

(1110) 

(00) 

(ton) 

(100) 

(10001) 

(ilO) 

(111010) 

(111) 

(1001  00Q) 

(000) 

(1  1111000) 

(100100) 

(0111001101) 

(110000011100001) 

(1T110T100010011100) 

(11011  1001  10) 

nenhUan  A 
rTOCMwH  4 

wrong-list 
(000) 

(1 1  000) 

(0001) 

(000000000) 

(11111  000011) 

(1  1010100000101  1  1) 
(1010010001) 

(0000) 

(00000) 

(11100) 

(010) 


right-list 

0 

(1) 

(0) 

(10) 

(01) 

(00) 

(100100) 

(001 1  1 1  1101  00) 
(0100100100) 


aTW*  proUam  w  introduced  at  tho  vory  beginning. 
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Problem  5 


right-list 

wrong-list 

0 

(1) 

(1  D 

(0) 

(00) 

(111) 

(1001) 

(010) 

(0101) 

(000000000) 

(1010) 

(1  000) 

(10  001  1  1101) 

(01) 

(10  01  1000011  1  1010) 

(10) 

(111111) 

(1  1  10010100) 

(00  00) 

ProblMifl 

(010111111110) 

(0001) 

(Oil) 

right-list 

wrong- list 

0 

(D 

(10) 

(0) 

(01) 

(1 1) 

(1100) 

(00) 

(101010) 

(101) 

(111) 

(Oil) 

(000000) 

(11001) 

(10111) 

(1111) 

(0111101111) 

(00000000) 

(100100100) 

(0101 11) 
(10111101111) 

(1  001001001) 

ProfahmU 

right-list 

wrong- list 

0 

(1010) 

(D 

(001 1  001 1  000) 

(0) 

(0101010101) 

(10) 

(101101® 

(01) 

(1  0101) 

(11111) 

(010100) 

(000) 

(1  01001) 

(00110011) 

(100100110101) 

(01  01) 

(0000100001  1  1  1) 
(00100) 

(011111011111) 

(00) 
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1 .5.2  Solution  of  Sample  Problems 
The  solution  of  these  problems  are: 

1.1* 

2.(10)* 

3.  any  string  without  an  odd  number  of  consecutive  0’s  AFTER  an  odd  number  of 
consecutive  1’s. 

4.  any  string  without  more  than  2  consecutive  0's. 

5.  any  string  of  even  length  which,  making  pairs,  has  an  odd  number  of  (0 1)  or  (1  0)’s. 

6.  any  string  such  that  the  difference  between  the  numbers  of  1's  and  0’s  is  3n. 

7.  oYoV. 

1 .5.3  Finite  Automata  of  Solutions 

The  machines  corresponding  to  these  solutions  are  as  follows. 

Solution  of  Problem  1 


INTRODUCTION 


1.5.4  Inverse  Problems 

We  also  consider  the  inverse  problems  of  these  sample  problems.  The  inverse  problems  are 
created  by  exchanging  the  right-list  and  wrong-list.  We  use  these  1 4  problems  in  our  experiments 
and  refer  to  the  inverse  problem  of  problem  1  as  problem  1-,  the  inverse  problem  of  problem  2  as 
problem  2-,  and  so  on. 
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CONSTRUCTION  OF  FINITE  AUTOMATA 

2.  Construction  of  Finite  Automata 

In  this  chapter,  we  describe  an  experiment  in  constructing  a  finite  automaton  with  n  states  from  a 
given  right-list  and  a  wrong-list,  using  hill-climbing.  In  particular,  we  let  n  equal  8.  We  shall  see  that 
each  of  the  14  problems  can  be  solved  in  at  most  a  few  thousands  steps. 


2.1  Algorithm 

The  hill-climbing  algorithm  of  this  experiment  is  shown  in  figure  2-1 . 

Figure  2-1 :  Flowchart  of  the  Hill-Climbing 


We  first  construct  a  random  machine  with  8  states.  We  next  make  a  copy  of  this  machine,  where  the 
copy  is  slightly  altered  from  the  original  by  an  operator  mutate.  We  compare  the  new  machine  with 
the  original  by  an  evaluation  function  E.  The  better  machine  is  called  currant  generation  and  we 
make  a  copy  of  this  machine,  and  so  forth.  The  worse  machine  is  simply  discarded.  The  operator 
mutate  and  the  evaluation  function  E  are  defined  more  precisely  in  the  following. 

Operator  mutate:  Taking  a  machine  ((Av  B1f  F^  . . .  (Aa,  Ba,  Fa))  as  its  argument  the  operator 
mutate  chooses  one  digit  randomly,  and  replaces  it  by  another  digit4  That  is,  the  mutation  in  our 
algorithm  is  randomly  one  of  tile  following:  delete  an  arrow,  insert  an  arrow,  change  the  destination 


40  «<  A.  .<8:0  *<  B.  ><8;  andO  «<F.  .<1. 


12 


CONSTRUCTION  OF  FINITE  AUTOMATA 


of  an  arrow  to  another  destination,  make  a  non-final  state  into  a  final  state,  and  make  a  final  state  into 
a  non-final  state. 

Evaluation  Function  E:  The  evaluation  function  E  takes  a  machine  as  its  argument  and  returns  r 
-  w,  where  r  is  the  number  of  strings  in  the  right-list  accepted  by  the  machine,  and  w  is  the  number  of 
strings  in  die  wrong-list  accepted  by  the  machine.  If  r  -  *  <  0  then  it  returns  0. 


2.2  Results 

We  show  in  this  chapter  the  result  of  our  experiments.  We  first  show  in  figure  2-2  the  trace  of  the 
experiment  of  problem  3,  to  see  how  bur  algorithm  gradually  refines  a  random  machine  into  the 
desired  machine.  Each  line  corresponds  to  the  current  generation  M.  The  column  E  indicates  E(M), 
and  6  indicates  the  cumulative  number  of  generation.  The  final  machine  of  this  trace  accepts  all 
strings  in  the  right-list  but  none  in  the  wrong-list  of  problem  3  (figure  2-3). 

We  show  the  results  for  the  other  13  problems  in  figure  2-4. 

Figure  2-2:  Sample  Trace  of  Problem  3 


CONSTRUCTION  OR  FINITE  AUTOMATA 


Figure  2*3:  The  final  machine  of  problem  3 


figure  2-4:  The  results  of  the  other  13  problems 
Final  Machine  of  Problem  1 
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1 
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Final  Machine  nf  Problem^ 
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((7  6  0)(4  2  0)(4  Z  1)(4  3  1)(6  Z  0)(Z  Q  0)(7  8  0)(6  0  1))  3728 
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2.3  Discussion 


2.3.1  Hill-Climbing  vs.  Exhaustive  Search 

To  see  how  effectively  our  hill-climbing  algorithm  has  performed,  we  compare  our  method  with  an 
exhaustive  search.  There  are  (9x9x2)®  ■  about  5  x  1017  machines  in  our  problem  space.  We  now 
./ant  to  know  the  number  of  the  desired  machines  in  our  problem  space,  so  that  we  can  calculate  the 
expected  number  of  steps  until  the  exhaustive  algorithm  finds  the  first  desired  machine.  This  can  be 
done  by  the  following  "sampling"  method:  take  one  machine  in  the  problem  space  randomly,  and  test 
if  this  machine  is  the  desired  machine;  repeat  this  procedure  100,000  times. 

We  show  the  expected  number  of  steps  using  the  exhaustive  search  calculated  by  this  procedure  in 
figure  2-5.  Although  the  exhaustive  search  works  better  on  "easy"  problems,  it  is  obvious  in  general 
that  our  hill-climbing  works  much  better  than  the  exhaustive  search. 

Figure  2-5:  The  number  of  Steps  to  get  the  desired  machine 


Pro bleu 

HI  ft -C11eb1ng 

Exhaustl  ve-Sear  clt 

pi 

98 

33 

92 

134 

316 

PS 

2062 

>  80000 

94 

442 

12800 

95 

1768 

>  80000 

95 

277 

60000 

97 

206 

60000 

PI- 

300 

167 

P2- 

80- 

1862 

P9- 

1930 

>  80000 

P4- 

246 

>  80000 

pa- 

1644 

>  80000 

pe- 

886 

>  80000 

P7- 

3728 

>  60000 

2.3.2  Result  with  Different  Numbers  of  States 

So  far,  we  fixed  the  number  of  states  to  be  &  In  this  section,  we  shall  try  the  same  experiment  with 
different  numbers  of  states  (4  - 10).  Figure  2-8  shows  the  result  of  this  experiment  In  the  table,  " 
indicates  "it  could  not  strive  within  the  given  time".  This  can  happen  when  the  hill-climbing  algorithm 
climbs  a  "local  hill".  This  table  implies  that  the  number  of  states  n  should  be  reasonably  large  to 
avoid  climbing  a  local  hill,  and  we  can  hardly  get  the  simplest  machine  by  this  method.  We  shall, 
however,  see  that  we  can  simplify  the  machine  with  8  states  that  we  have  gotten  in  this  chapter,  so 
that  it  becomes  the  simplest  machine. 
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Figure  2-6:  The  Number  of  States  and  Runtime  [sec.] 


NUMBER  OF  STATES 


PROBLEM 

4 

6 

6 

7 

8 

9 

10 

1 

0.4 

0.4 

1.9 

6.3 

2.2 

1.9 

0.6 

2 

0.6 

0.4 

6.4 

2.2 

3.0 

4.1 

3.6 

3 
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— 

3.3 

'39.6 

129.0 

16.2 

168.3 

4 

12.3 

4.8 

18.1 

9.4 

13.1 

1.4 

11.7 

5 

— 

— 

184.7 

7.9 

66.6 

220.4 

'  96.6 

6 

3.1 

12.5 

6.2 

20.6 

7.9 

26.3 

137.8 

T 

49.6 

10.9 

23.2 

2.8 

7.1 

6.6 

18.6 

1- 

2.4 

0.8 

1.3 

4.3 

8.1 

1.8 

2.6 

2- 

16.3 

1.9 

13.9 

7.4 

2.4 

19.2 

17.9 

3- 

... 

— 

— 

76.6 

78.4 

— 

243.6 

4- 

— 

23.9 

12.6 

20.7 

7.7 

14.8 

17.6 

6- 

18Z.0 

— 

— 

28.8 

66.5 

62.1 

68.4 

8* 

2.8 

3.5 

63.4 

13.6 

29.0 

6.6 

7.6 

7- 

... 

— 

283.6 

- — 

138.7 

64.6 

33.3 

SIMPLIFICATION  OF  FINITE  AUTOMATA 

3.  Simplification  of  Finite  Automata 

In  the  previous  chapter,  we  saw  that  our  hill-climbing  method  successfully  produced  a  machine 
that  accepts  all  strings  in  the  right-list  but  no  string  in  the  wrong-list.  However,  the  final  machine  of 
the  result  of  problem  2,  for  example,  does  not  accept  our  desired  regular  set  (1  0)’.  For  instance,  it 
does  accept  a  string  (110  0),  which  is  not  in  (1  0)*.  We  therefore  want  the  machine  to  be 
"generalized"  so  that  it  accepts  exactly  (1  0)*.  In  fact,  the  final  machines  of  all  problems  except 
problem  1,  3  and  7,  need  to  be  generalized. 

We  define  the  generality  of  a  machine  in  terms  of  its  simplicity.  The  simplicity  of  a  machine  is 
determined  by  the  number  of  states  the  machine  has,  and  if  two  machines  have  the  same  number  of 
states,  a  machine  with  fewer  arrows  and  final  states  is  simpler. 

Our  task  is  to  simplify  the  machines  we  have  obtained  in  the  previous  chapter,  so  that  the  machines 
become  the  simplest  or  the  most  general .  We  call  this  task  simplification  of  finite  automata,  and  it 
can  be  also  done  by  using  a  hill-climbing  method. 


3.1  Minimization 


Before  we  simplify  the  final  machine  of  the  previous  experiment,  we  first  remove  any  useless  arrows 
and  states,  using  a  Minimization  Algorithm  (see,  for  example,  [Hopcroft  79]).  We  show  the  result  of 
the  minimization  in  figure  3-1.  Note  that  even  after  minimization,  all  problems  except  1,  3  and  7  still 
need  to  be  generalized. 

Figure  3-1:  Minimized  Final  Machine 


Problem  Min lei zed  Machine 


n  ((0  1  1)) 

P2  ((0  4  1)(3  0  0)(8  0  1)(1  8  Q)(2  2  0)(B  6  1)) 

P3  ((1  2  1)(3  l  1)(4  0  Q)(3  4  1)) 

P4  ((4  1  1)(3  3  l)(l  2  0)(5  1  1)(0  3  1)) 

P5  ((6  3  1) (3  6  0)(2  1  0 ) ( 5  4  0)(0  1  0)(1  4  0)) 

P8  ((8  3  1)(8  3  1)(1  8  Q)(2  6  0)(4  2  0)(4  1  0)) 

P7  ((1  2  1)(3  2  1)(3  4  1)(0  4  1)) 

PI-  ((3  2  Q)(8  1  0 ) ( 3  5  1)<3  4  1)(Q  4  1)) 

P2-  ((8  8  0)(2  3  1)(2  0  0)(2  6  1)(4  2  1)(1  4  1)) 

P3-  ((3  8  Q)(4  6  1)(1  4  0)(2  4  0)(5  7  1)(2  3  0)(Q  8  1)) 

P4-  ((4  7  Q)(2  2  1)(2  8  0)(6  1  0)(2  0  Q)(7  4  1)(1  3  0)) 

P8-  ((4  3  0) (8  8  0)(8  2  1)(1  6  1)(3  1  1)(5  4  1)) 

P8-  ((2  4  0)(4  1  1 ) ( 3  2  0)(t  5  1)(3  1  1)) 

P7-  ((7  8  0)(4  2  0)(4  2  1)(4  3  1)(5  2  0)(2  0  0)(7  8  0)) 


3.2  Simplification  Algorithm 

The  algorithm  for  simplification  is  similar  to  the  algorithm  described  in  the  previous  chapter.  The 
major  differences  are  as  follows:  (1)  the  evaluation  function  E(M)  returns  a  higher  value  if  the 
machine  M  is  simpler  (2)  if  M  does  not  accept  some  strings  in  the  right-list,  or  does  accept  some 
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strings  in  the  wrong-list,  E(M)  returns  minus  infinity;  (3)  the  algorithm  starts  with  the  minimized  final 
machine  of  the  previous  experiment  instead  of  a  random  machine;  (4)  whenever  a  "useless  state"  (i.e. 
a  non-final  state  with  neither  0-arrow  nor  1  -arrow)  is  found,  delete  it. 


3.3  Results 

A  sample  trace  of  problem  2-  is  shown  in  figure  3-2.  Each  line  corresponds  to  current  generation 
M,  and  the  right-most  number  is  the  cumulative  number  of  steps.  The  final  machine  of  this  trace  is  the 
desired  simplest  machine. 

The  final  machines  of  all  14  problems  are  shown  in  figure  3-3.  We  see  that  some  problems  could 
not  be  simplified  completely  within  the  given  time,  probabiy  because  the  search  was  climbing  a  local 
hill. 


3.4  Discussion 


3.4.1  Hill-Climbing  vs.  Exhaustive  Search 

We  compare  our  method  with  an  exhaustive  search.  The  exhaustive  search  enumerates  all 
machines  in  the  order  of  simplicity,  and  the  first  machine  that  accepts  ail  strings  in  the  right-list  but 
none  in  the  wrong-list  is  considered  the  simplest  machine.  Thus  we  can  calculate  the  expected 
number  of  steps  until  the  exhaustive  search  finds  the  desired  machine5 .  The  result  is  shown  in  figure 
3-4® 


*Let  n  be  the  number  of  states  at  the  simplest  machine.  Then  the  expected  number  of  steps  Sn  * 
where  U  is  the  number  of  aH  possible  machines  with  f  states,  that  is, 

Uj.0*1)a,,*2n 

®The  number  of  steps  using  hHI-cMmbing  in  this  figure  is  the  sum  of  the  number  of  steps  to  construct  the  8  state  machine  and 
the  number  of  steps  to  simplify  it  into  the  simplest  machine. 
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Figure  3-2:  Sample  Trace  of  Problem  2- 


[P2]  ((0  2  1)(1  0  0))  7 

[P4]  ((2  1  1)(3  1  1)(0  1  1))  88 

[P6]  ((4  3  1} (3  4  0 ) ( 2  1  0)(1  2  0))  42 

[P8]  ((3  2  1)(1  3  0) ( 2  1  0))  174 

[P1-]  ((2  1  0)(2  2  1))  146 

[P2-]  ((2  3  0)(2  2  1)(1  2  1))  871 

[P3-]  ((1  5  0 ) ( 3  4  1 ) ( 2  3  0)(2  4  1)(2  1  0))  383 

[P4-]  ((3  6  0)(2  2  1)(4  1  0)(2  0  0)(1  1  0))  <HOT-SIHPLEST> 

[P6-]  ((4  3  0)(6  6  0 ) ( 6  2  1)(1  B  1)(3  1  1)(6  4  1))  <80T-SIMPLEST> 

[P8-]  ((2  3  0 ) ( 3  1  1)(1  2  1))  44 

[P7-]  ((1  6  Q)(4  6  0 ) ( 4  2  1)(4  3  1)(5  2  0)(4  0  0))  <80T-SIHPLEST> 


Tie."  OF 


TA 


Figure  3-4:  The  Number  of  Stops  to  obtain  the  simplest  machine 


Problem  Hill-Climbing  Exhaustive-Search 


PI 

96 

4 

P2 

141 

170 

P3 

2062 

553933 

P4 

610 

8624 

P5 

1810 

553933 

P0 

461 

8524 

P7 

206 

653933 

PI- 

446 

170 

P2- 

1060 

8624 

P3- 

2302 

46693884 

P4- 

— 

653933 

P5- 

— 

663933 

P0- 

930 

8624 

P7- 

46593884 

3.4.2  Simplification  from  Trivial  Machine 

We  have  seen  that  our  hill-climbing  works  rather  successfully,  although  some  problems  could  not 
be  simplified  completely.  Our  method  consists  of  2  parts,  the  construction  process  (chapter  2)  and 
the  simplification  process  (chapter  3).  That  is,  we  first  construct  a  machine  with  8  states  and  then 
simplify  it.  One  might  suppose  that  we  could  get  the  simplest  machine  using  only  the  construction 
process,  by  choosing  the  number  of  states  sufficiently  small.  Unfortunately,  in  the  previous  chapter, 
we  showed  that  the  number  of  states  should  be  reasonably  large,  and  we  cannot  do  that.  One  might 
also  notice  that  we  would  not  need  any  construction  process,  because  we  can  easily  construct  a 
trivial  machine,  which  accepts  exactly  all  strings  in  the  right-list  but  nothing  else.  Figure  3-6  is  an 
example  of  the  trivial  machine.  In  this  section,  we  describe  some  experiments  to  try  to  simplify  from 
the  trivial  machine.  We  shall  see  that  to  simplify  from  the  trivial  machine  is  much  less  effective  than 
our  construction-simplification  method.  The  result  of  the  experiments  is  shown  in  figure  3-7.  When 
we  compare  figure  3-3  and  figure  3-7,  it  is  obvious  that  our  construction-simplification  method  is  more 
effective  than  the  second  method. 


26 


•  u  i 


r  a 


Figure  3-6:  Trivial  Machine  of  Problem  5 


Figure  3*7:  Result  of  Simplification  from  Trivial  Machine 


[PI]  ((0  1  1))  2  107 
[P2]  ((0  2  1)(1  0  0))  3  132 

[P3]  ((1  2  1) (6  4  1)(3  6  0)(7  8  1)(6  0  0)(0  2  1)(3  0  1) (0  9  1)(10  0  0) 

(0  11  0)(0  12  0 ) ( 13  0  0)(14  0  0 ) ( 2  0  0))  24  <H0T-SIMPLEST> 

[P4]  ((3  2  1)(1  6  1 ) ( 4  2  1) ( 0  5  1) (0  2  0))  12  <NOT-SIMPLEST> 

[P6]  ((3  2  1) (4  1  0)(1  4  0)(2  3  0))  9  1879 

[P6]  ((3  2  1) ( 1  3  0)(2  1  0))  7  1801 

[P7]  ((3  2  1) ( 4  8  l)(t,  6  1)(0  0  1)(1  2  1) (3  3  0))  16  <NOT-SIMPLEST> 

CPI- J  ((2  1  0) ( 2  2  1))  5  448 

[P2-]  ((3  2  0)(1  3  1 ) ( 3  3  1))  8  1249 

[P3-]  ((1  2  0)(3  1  0)(6  4  1)(4  3  1) ( 3  6  0))  12  <NOT-SIHPLEST> 

[P4-]  ((3  1  0) (2  2  1)(4  1  0 ) ( 2  1  0))  9  3692 

[P5-]  ((3  2  0) ( 2  4  1)(1  8  1)(0  6  0)(8  0  1) ( 7  0  1) ( 0  10  1)(9  0  0)(0  3  0) 
(0  11  0) ( 0  12  0)(0  13  0)(0  14  0) (0  9  0))  22  <N0T-SIMPLEST> 

[P6-]  ((3  2  0) (6  3  1)(4  1  1)(1  0  1)(1  2  0))  12  <MOT-SIMPLEST> 

[P7-]  ((3  7  0) (2  2  1)(4  7  0)(0  6  0)(0  6  0) ( 7  0  0) (8  0  0)(2  8  0))  13 

<KOT-SIMPLEST> 


27 


T£  CO'.'jTPL'CTIC?!  OF  F'i-ITE  AUTOMATA 


A 


Ha-conct ruction  of  Finite  Automata 


So  far,  we  have  described  a  method  for  constructing  the  simplest  Finite  Automaton  from  given 
examples.  Suppose  we  have  solved  one  problem,  and  are. given  another  problem  whose  examples 
are  very  close  to  the  previous  one.  To  solve  this  new  problem  starting  from  the  beginning  is  rather 
tedious  because  we  already  have  some  information  about  the  solution.  In  this  chapter,  we  describe 
how  to  re  construct  a  finite  automaton  if  the  right-list  and/or  wrong-list  is  slightly  altered. 

After  the  sample  lists  are  altered,  if  the  machine  still  accepts  all  strings  in  the  right-list  but  no  strings 
in  the  wrong-list,  the  previous  solution  is  the  new  solution.  If  the  machine  does  not  accept  some 
strings  in  the  right-list,  and/or  does  accept  some  strings  i  the  wrong-list,  we  refer  to  such  strings  as 
inconsistent  strings.  Whenever  we  find  an  inconsistent  string  in  the  right-list,  we  call  a  procedure, 
add-trivially,  which  revises  the  machine,  so  that  it  accepts  all  strings  in  the  right-list.  On  the  other 
hand,  whenever  we  find  an  inconsistent  string  in  the  wrong-list,  we  call  a  procedure, 
cut-wrong-arrow,  which  revises  the  machine,  so  that  it  accepts  no  string  in  the  wrong-list.  Although 
after  calling  add-trivially  there  is  no  inconsistent  string  in  the  right-list,  there  may  now  be  another 
inconsistent  string(s)  in  the  wrong-list.  In  this  case,  we  call  cut-wrong-arrow.  Similarly,  although  after 
calling  cut- wrong- arrow  there  is  no  inconsistent  string  in  the  wrong-list,  there  may  now  be  another 
inconsistent  string(s)  in  the  right-list.  In  this  case,  we  call  add-trivially.  Thus,  we  call  add-trivially  and 
cut-wrong-arrow  again  and  again. 

We  first  define  add-trivially  and  cut-wrong-arrow,  and  then  we  show  that  our  process  always 
terminates,  producing  the  desired  machine  that  accepts  all  strings  in  the  right-list  but  no  string  in  the 
wrong-list,  although  the  machine  is  not  the  simplest. 


4.1  Add<trivially 

The  purpose  of  this  add-trivially  routine  is  to  accept  an  inconsistent  string  in  the  right-list,  no  matter 
how  many  strings  in  the  wrong-list  the  machine  comes  to  accept.  We  first  define  trivial  state  and 
trivial  path ,  then  finally  we  define  add-trivially. 

Definition:  In  each  machine,  we  consider  that  there  is  a  special  arrow  named  starting  arrow,  which 
always  points  to  the  initial  state  qv 

Definition:  If  more  than  one  arrow  (including  the  starting  arrow  and  the  one  from  q  itself)  point  to 
a  state  q,  then  q  is  called  a  non-trivial  state.  If  only  one  arrow  points  to  q,  then  q  is  called  a  trivial 
state. 

Definition:  A  sequence  of  states  q.(1),qj(2), . qj(k)  is  called  a  path  of  a  string  ava2 . aM,  where 

each  <r  is  in  {1,0},  iff  for  all  j  such  that  1  <  j  <  k-1,  if  a;  «  0  then  A^  =  i(j  + 1)  else  »  i(j  + 1). 

Definition:  A  sequence  of  states  qj(1),q|(2) . qj(k)  is  called  a  trivial  path,  iff  this  sequence  is  a  path, 

and  for  all  j  such  that  2  <  j  <  k,  q.(j)  is  a  trivial  state,  and  for  ail  j  such  that  2  <  j  <  k-1,  qj(j}  is  a 
non-final  state,  and  qj(k)  is  a  final  state.  This  path  accepts  only  one  string. 

That  the  machine  M  does  not  accept  a  string  aya2 . ak  means  either  of  the  followings: 

1.  There  is  a  path  of  aya2 . ,ak,  but  the  last  state  is  a  non-final  state. 

2.  There  exists  an  integer  j  such  that  there  is  a  path  of  a1 . aj  t  ,  but  the  last  state  of  this 

path  does  not  have  an  ttarrow. 

where  each  aj  is  in  {1 ,0}. 
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For  each  inconsistent  string  in  the  right-list,  add-trivially  works  as  follows:  in  case  1.  let  the  last 
non-final  state  be  the  final  state;  in  case  2.  create  a  trivial  path  from  the  last  state  so  that  the  machine 
accepts  the  whole  string. 

It  is  easy  to  show  that  after  calling  add-trivially  the  machine  accepts  all  strings  in  the  right-list. 
However,  it  also  may  come  to  accept  some  strings  in  the  wrong-list,  as  we  mentioned  before.  In  this 
case,  we  call  cut-wrong- arrow  defined  below. 


4.2  Cut-wrong-arrow 

If  there  are  some  inconsistent  strings  in  the  wrong-list  (i.e.  the  machine  does  accept  the  strings),  we 
call  cut-wrong-arrow  so  that  the  machine  comes  to  accept  none  of  these  strings,  no  matter  how  many 
strings  in  the  right-list  the  machine  comes  to  reject. 

For  each  inconsistent  string  in  the  wrong-list,  cut-wrong-arrow  works  as  follows:  Let 

q,irq,(2), . qj(k)  be  a  path  of  the  string  w  that  should  not  be  accepted.  To  reject  w,  one  of  the  arrows 

of  the  path  must  be  cut.  Let  qj(n)  be  one  of  the  non-trivial  states  in  the  path.7  Cut  the  arrow  from 
qi(n-i) to  qi(n>-  lf  qi  (in't'al  state)  is  the  only  non-trivial  state,  then  let  the  machine  M  be  ((0  0  0)), 
wnicn  does  not  accept  anything. 

It  is  easy  to  show  that  after  calling  cut-wrong-arrow  all  strings  in  the  wrong-list  are  rejected, 
although  the  machine  may  come  to  reject  some  strings  in  the  right-list.  In  this  case,  we  call  add- 
trivially. 


4.3  Termination 

In  this  section,  we  show  that  the  algorithm  above  always  terminates. 

Theorem:  The  algorithm  above  always  terminates. 

Proof:  Consider  the  following  partial  ordering: 

non-triviality  of  state:  the  number  of  arrows  which  point  to  the  state. 

non-triviality  of  machine:  total  of  non-triviality  of  all  non-trivial  states. 

We  denote  this  by  nt(M),  where  M  is  a  machine.  Note  that  flt(M)  =  0,  iff  M  is  a  trivial  machine. 

Let  M’  be  the  result  of  adding-trivially  to  M,  then  nt(M')  *  ftt(M),  because  add-trivially  adds  only  a 
trivial  path.  Next,  let  M’  be  a  result  of  cut-wrong-arrow  over  M,  then  rtf(M’)  <  nt(M),  because  we 
always  cut  the  arrow  that  points  to  a  non-trivial  state  q,  and  non-triviality  of  the  state  q  decreases,  and 
therefore  non-triviality  of  machine  also  decreases.  Thus,  we  cannot  have  an  infinite  loop,  add- 

trivially  ,  cut-wrong-arrow,  add-trivially,  cut-wrong- arrow,  add-trivially .  because  nt(M)  always 

decreases  but  nt(M)  >  0.  <end  of  proof> 


7 Such  a  non  trivial  state  always  exists  it  the  original  machine  has  (seen  simplified,  and  throughout  this  paper,  we  deal  only 
with  the  re  construction  ot  a  simplified  machine. 
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Finally,  we  describe  an  actual  system.  RR.  that  learns  to  construct  finite  automata.  RR  is  running  in 
MACLISP  either  on  CMU-2CC  or  CMU-10A. 

RR  has  a  machine  (finite  automaton)  and  each  time  RR  is  given  a  string  in  (1  +  0)'  as  its  input,  RR 
runs  the  machine  with  the  string  given.  If  the  machine  accepts  the  string,  RR  answers  ACCEPT, 
otherwise  it  answers  REJECT.  At  the  very  beginning,  RR  has  a  null  machine,  which  accepts  nothing, 
and  therefore  RR  does  not  accept  any  string  at  all.  Now,  consider  some  regular  set  1R  that  we  want  to 
teach  to  RR.  When  we  input  a  string  s  to  RR,  it  should  accept  s  if  and  only  if  s  is  in  !R.  If  s  in  not  in  IR, 
RR  should  reject  it.  Whenever  RR  answers  incorrectly,  we  scold  it.  When  RR  answers  correctly  and 
we  think  this  example  is  important8  ,  we  encourage  it.  When  RR  is  scolded  or  encouraged,  it 
memorizes  the  fact  that  the  string  must  be  accepted  or  rejected,  that  is,  if  it  is  the  case  that  the  string 
must  be  accepted.  RR  puts  it  into  right-list,  which  is  a  set  of  strings  that  must  be  accepted,  and 
similarly,  if  the  string  must  be  rejected,  RR  puts  it  into  wrong-list.  After  memorizing,  RR  re-constructs9 
the  machine  in  the  way  described  in  chapter  4,  so  that  it  accepts  all  strings  in  the  right-list  and  none 
in  the  wrong-list.  After  each  re-construction,  RR  simplifies  the  machine  in  the  way  described  in 
chapter  3. 

Figure  5-1  shows  a  flow  chart  of  the  RR  system. 


*We  do  not  need  to  encourage  it  every  time  it  answers  correctly. 

Q 

Onty  when  it  has  been  scolded. 


> 

i 
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Figure  5-1 :  Top  Level  of  RR  System 

_ \  / _ 

Take  input  S 


accented  /  .  .  rejected 

i  ■  (machine  j  ■  1  - 


Print  "ACCEPT' 


Print  "REJECT' 


^encou raged 


scolded , 


comment 

\?/ 


nothing 


encourage  c 
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5.1  How  to  execute  the  RR  system 

In  this  section,  we  describe  how  to  execute  the  RR  system,  and  in  the  following  section,  we  show 
several  sample  runs. 

5.1.1  Getting  Started 

RR  runs  in  MACLISP  either  on  CMUC  or  CMUA.  in  MACLISP,  type 
(slurp  <tommy>  rr)  (CMUC) 
or 

(slurp  c410mt80  rr)  (CMUA). 

And  call  function: 

(main)  (both  CMUA  and  CMUC). 

Then  you  get  prompt  ">»"  and  are  in  the  RR  system. 

5.1.2  How  to  teach 

•  Giving  example:  The  format  for  giving  an  example  to  RR  is  the  following: 

(  O-or-1  <space>  O-or-1  <spaca> . <spa  ca>  O-or-1 ) 

Typical  input  is: 

>>>(10101010) 

RR  then  outputs  the  answer,  either  ACCEPTED  or  REJECTED. 

•  Scolding:  To  scold  for  a  wrong  answer,  input  n  right  after  the  wrong  answer. 

>»n 

•  Encouraging:  To  encourage  RR,  input  y  right  after  the  answer. 

>»y 

•  Anyway- accept:  If  the  example  string  starts  with  +,  this  means  :  if  this  string  is  accepted 
then  encourage ;  otherwise  scold.  Typical  input  is: 

>>>(+101001) 

•  Anyway- reject:  If  the  example  string  starts  with  -,  this  means:  if  this  string  is  rejected 
then  encourage;  otherwise  scold.  A  typical  input  is: 

>>>(-  0  0  0  1  0  0  1  0) 

To  give  the  null  string,  use  ()  or  (+)  or  (-). 

5.1 .3  Other  Commands 

•  r:  show  present  right-list. 

•  show  present  wrong-list. 
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•  m:  show  present  machine. 

•  1 :  show  last  input. 

•  o:  show  order  of  memorized  strings. 

•  t:  show  runtime  of  each  step  and  total  runtime. 

•  ?:  show  every  thing  above. 

•  new:  initialize. 

•  tG.  quit. 

5.2  Sample  Runs 


5.2.1  Sample  Run  1: 

As  the  simplest  example,  let  us  teach  the  regular  set  1*  to  RR.  The  desired  machine  is: 

((0  1  1)). 

The  underlined  strings  are  user’s  inputs,  and  the  Italic  strings  are  comments. 

[PHOTO:  Recording  Initiated  Thu  4-Mar-82  2:38PM] 

TOPS-20  Command  processor  4(723) -7 


91111 

[Keeping] 

MacLIsp  for  TOMMY 
• 

laliO-Slfflmifc  rr) 

(<TOMMY>  RR  FASL) 

(lain) 

»>  him 

>»  u 

REJECTED 

»>  n 

MODIFYING  • 

>»  111 

REJECTED 

>»  n 

MODIFYING  •• 

»>  U-Ui 

ACCEPTED 

»>  UU 

REJECTED 

»>  U-fl  1  1.1) 

REJECTED 

»>  tillllllltll-llll 
ACCEPTED 
»>  1 
RI6HT-LIST 


Initialization. 

Input  null  string  as  an  example. 

The  null  string  was  rejected. 

Since  null  should  be  accepted,  scold  it. 

It  is  trying  to  modify. 

Next,  input  (11 
(1)  was  rejected. 

Since  (l)  should  be  accepted,  scold  it. 

It  is  modifying  itself. 

Next  try  (1  1  1). 

This  was  accepted,  all  right,  no  scolding. 

Next  try  (0).  which  should  not  be  accepted. 

This  was  rejected,  all  right,  no  scolding. 

Next  try  (10  11  1).  which  should  not  be  accepted. 
Rejected,  all  right,  no  scolding. 

Next  try  this. 

Accepted,  all  right,  it  should  be  accepted. 

Maybe  we've  got  1  let  us  look  inside  the  machine. 
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(MIL  (1)) 

WRC.'iG-LIST 

NIL 

PRESENT-MACHINE 
((0  1  D) 

LAST- INPUT 

(+11111111111111) 

ORDER 

((+)  (  +  l)) 

TIME 

(0.019  0.048) 

TOTALTIME 

(0.067) 


Rignt-list  contents  null  string  and  (1). 

Wrong-list  contents  nothing. 

Present  machine  is,  yes,  l\ 

We  taught  it  in  this  order.  +  means  "in  right-list". 
Time  spent  to  teach  ( + )  and  ( +  1). 

Total  time  in  seconds  to  learn  T*. 


5.2.2  Sample  Run  2: 


Let  us  try  to  teach  a  harder  automaton,  problem  4.  This  regular  set  is: 

The  difference  between  the  nuaber  of  0's  and  the  nuaber  of  l's  Is 
divisible  by  3. 

For  instance,  the  string  (10  111)  should  be  accepted  because  4-1  >3  is  divisible  by  3.  The 
desired  machine  is  as  follows: 

((3  2  1)(1  3  0)(2  1  0)). 


»>  on 

»>  n 

REJECTED 
»>  fl 

MODIFYING  • 

»>  ■ 

((0  0  1)) 

»>  (1  1  1) 

REJECTED 

»>  fl 

MODIFYING  •• 

»>  ■ 

((o  l  l)) 

»>  ill 
ACCEPTED 
»>  n 

MODIFYING  •• 

»>  | 

((0  2  1)  (0  3  0)  (0  1  0)) 

»>  u  1 1  mi 

ACCEPTED 

»>  m 

REJECTED 

»>  X 

REJECTED 


First,  let  us  try  null,  which  should  be  accepted. 

No,  null  should  be  accepted. 

Show  the  present  machine. 

This  machine  accepts  nothing  but  a  null  string. 

No,  this  should  be  accepted. 

This  machine  is  1  *. 

No,  this  should  be  rejected. 

This  machine  is  (1  1  1)’. 

All  right,  it  should  be  accepted. 

Yes,  it  should  be  rejected.  Particularly,  encourage  it. 
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»>  (0  0  0) 

REJECTED 

>>>  n  No.  this  should  be  accepted. 

MODIFYING  •• 


»>  (0  0  0  flJLft) 

ACCEPTED  All  right. 

»>  (Q  Q\ 

REJECTED  All  right. 

»>  m 

((4  2  1)  (0  3  0)  (0  1  0)  (5  0  0)  (1  0  0)) 

>»  a  cn 

REJECTED 

»>  fl 

MODIFYING  ••• 

>»  0 

((4  2  1)  (1  3  0)  (0  1  0)  (2  0  0)) 

»>  (0  n 

REJECTED 

»>  n 

MODIFYING  ••• 

»>  9 

((3  2  1)  (1  3  0)  (2  1  0)) 

»>  11  1  0  0  0  0 


(111  +  000) 


No,  this  should  be  accepted. 


1  0  0  1  0  Q  1  0  0  0 


REJECTED 


No,  this  should  be  accepted. 


Now,  we  get  the  desired  machine. 

o  <n 


Ok. 


»>  u  no  .0  01001001000001  » 

ACCEPTED  Ok. 

>»  2 

RIGHT-LIST 

(NIL  (1  1  1)  (0  0  0)  (1  0)  (0  1)) 

WRONG-LIST 

(d)  (0)) 

PRESENT-MACHINE 

((3  2  1)  (1  3  0)  (2  1  0)) 

LAST- INPUT 

(+11000010010010000011) 

ORDER 

((+)  (+  1  1  1)  (-  1)  (-  0)  (+  0  0  0)  (+  1  0)  (+  0  1)) 

TIME 

(0.014  0.087  0.117  -0.01  0.364  0.864  1.413) 

TOTALTIME 

(2.640) 


5.2.3  Sample  Run  3: 

The  total  run  time  to  team  the  desired  machine  depends  very  much  on  the  order  of  input  examples. 
We  now  try  the  previous  sample  again  but  with  a  different  order. 

»>  jbh 

»>  a 
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REJECTED 

»>  n 

MODIFYING  • 

»>  § 

((0  0  1)) 

>»  L 1  oi 

REJECTED 

>»  n 

MODIFYING  •• 

>»  fO  11 
REJECTED 

»>  n 

MODIFYING  •• 

»>  a 

((3  2  1)  (1  0  0)  (0  1  0))  Present  machine  is  (10  +  01 

»>  g,  i .» 

REJECTED 

»>  fl 

MODIFYING  ••• 

»>  ■ 

((3  2  1)  (1  3  0)  (0  1  0))  (10  +  01  +  inf 

»>  U- 1) 

REJECTED 

»>  (1.1  1  1  1,1) 

ACCEPTED 

»>  XJULftl 

REJECTED 

»>  n 

MODIFYING  ••• 

»>  ■ 

((3  2  1)  (1  3  0)  (2  1  0))  This  is  the  desired  machine. 

»>  (Q- 1  0- 1  a  1-Q  1-0  0  Q  Q  0  0  1  1  1  1  1  1  1  1  n 

ACCEPTED 

»>  (X  0  1.  Q  1  Q  1  0  0  Q  0  Q  .Q  fl  1  1  1  ;  0.  1  1  11  1) 

REJECTED 

»>  2 

RIGHT-LIST 

(NIL  (1  0)  (0  1)  (1  1  1)  (0  0  0)) 

WRONG- LIST 
NIL 

PRESENT-MACHINE 

((3  2  1)  (1  3  0)  (2  1  0)) 

LAST- INPUT 

(-101010100000001111011111) 

ORDER 

((+)  (+  1  0)  (+  0  1)  (♦  1  1  1)  (♦  0  0  0)) 

TIME 

(0.014  0.006  0.068  0.21  0.876) 

TOTALTIME 

(1.262)  The  total  time  is  much  shorter. 
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5.2.4  Sample  Run  4: 

V.'e  nevt  try  problem  3.  which  is  very  hard.  This  regular  set  is: 

Any  strings  without  odd  number  of  consecutive  0’s  AFTER  odd  number 
of  consecutive  l’s. 

»>  new 

»>  Ollt  1  H±  0-H-  1.0H  +  0  11(4  1  1)X±_0  OH-  1  0  1  flK±  i_Q_QA 

(+  1  1  0)(  +  1  1  1)(  +  0,0  0H-_.l  0  1  OH-  1  1  X  OH-  1  0  1  1H-  1  0  0_Q_J) 
(-11101  0U-  1  0  0  1  0  0  OH-  1  1  1  1  1  Q  0  01 

Ll  Q.  1-1_1  0  0  U0.m-  l  0-1  HO  -LLJU 

(+1  1  000001110000  1)(+  ill  10  no.oouionmi 

MODIFYING  • 

»> 

MODIFYING  •• 

»> 

MODIFYING  •• 

>» 

MODIFYING  •• 

»> 

ACCEPTED 

»> 

MODIFYING  •• 

>» 

ACCEPTED 

>» 

REJECTED 

>» 

REJECTED 

>» 

MODIFYING  •• 

»> 

ACCEPTED 

»> 

ACCEPTED 

>» 

ACCEPTED 

»> 

REJECTED 

>» 

REJECTED 

>» 

REJECTED 

>» 

MODIFYING  •••••••• 

»> 

REJECTEO 

>» 

MODIFYING  . . 

»> 
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^EJECTED 

»> 

REJECTED 

»> 

REJECTED 

>» 

MODIFYING  *• 

»> 

MODIFYING  ••••• 

»>  1 

RIGHT-LIST 

(NIL(1)(0)(0  1 )( 1  1)(0  0) ( 1  0  0)(1  1  0)(1  1  1) ( 0  0  0)(1  1000001 
110000  1)(1  11101100010011100)) 

WRONG-LIST 

((1  0) ( 1  0  1)(0  1  0)(1  0  1  0) ( 1  1  1  0)(1  0  1  1)(1  000  1)(1  1  1  0  1  0)( 
100100  0) ( 1  11110  0  0)(0  11100110  1)(1  1011100110)) 
PRESENT-MACHINE 

((1  2  1)(3  1  1)  (4  0  0)(3  4  1))  This  is  the  desired  machine. 

LAST- INPUT 

NIL 

ORDER 

((+)(+  1)(+  0)(-  1  0)(+  0  1)(+  1  1)(+  0  0)(-  1  0  1)(-  0  1  0)(+  1  0  0)( 

+11  0)(+  1  1  1)(+  0  0  0)(-  101  0)(-  111  0)(-  101  1)(-  1  0  0  0  1)( 
-11101  0) (-  100100  0) (-  1111100  0)(-  011100110  1)(- 
1101110011  0)(+  11000001110000  1)(+  111101100 
010011100)) 

TIME 

(0.014  0.042  0.08  0.078  8.0E-3  0.116  8.0E-3  0.013  0.011  0.357  0.011  0.012 
3.0E-3  9.0E3  0.01  0.011  2.066  0.012  3.686  0.014  0.018  0.02  0.283  3.736) 

TOTALTIME 

(10.282) 


5.2.5  Sample  Run  5: 

We  now  try  the  previous  run  again  with  a  more  effective  ordering. 

»>  new 

»>  f-  1  OH-  10  0  1  OH-  10  0  OH-  10  0  11  0H  +  )ffl(+  01m(+  0  llw 
f+  0  1  1  0)mf  +  1  0  0)m(+  1  Utnf+10  0  l)m(+  1  0_0_Q_01 
REJECTED 
»> 

REJECTEO 

»> 

REJECTED 

»> 

REJECTED 

»> 

MODIFYING  • 
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((0  0  1)) 

»> 

MODIFYING  •• 

»> 

(d  0  1)) 

»> 

MODIFYING  • 

»> 

((1  2  1)  (0  0  1)) 

»> 

MODIFYING  •• 

»> 

((1  2  1)  (0  1  1)) 

»> 

MODIFYING  • 

»> 

((1  2  1)  (3  1  1)  (4  0  0)  (0  0  1)) 

»> 

ACCEPTED 

»> 

((1  2  1)  (3  1  1)  (4  0  0)  (0  0  1)) 

»> 

MODIFYING  •• 

»> 

((1  2  1)  (3  1  1)  (4  0  0)  (0  4  l)) 

»> 

MODIFYING  ••• 

>»  ■ 

((1  2  1)  (3  1  1)  (4  0  0)  (3  4  1)) 

»>  1 

RIGHT-LIST 

(NIL  (0)  (0  1)  (0  1  1  0)  (1  0  0)  (1  1)  (1  0  0  1)  (1  0  0  0  0)) 

WRONG-LIST 

((1  0)  (1  0  0  1  0)  (1  0  0  0)  (1  0  0  2  1  0)) 

PRESENT-MACHINE 

((1  2  1)  (3  1  1)  (4  0  0)  (3  4  1))  This  is  the  desired  machine. 

LAST- INPUT 

NIL 

ORDER 

((-  1  0)(-  1001  0)(-  100  0)(-  10011  0)(+)(+  0)(+  0  1)(+  0110) 
(+10  0)(+  1  1)(+  100  1)(+  10000)) 

TIME 

(  9.0E-3  9.0E-3  0.017  9.0E-3  0.016  0.047  0.029  0.091  0.042  0.013  0.38  0.342) 


TOTALTIME 

(0.89) 


This  is  much  taster  than  the  previous  run. 
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V.'e  saw  in  the  previous  section  that  the  run-time  of  sample  run  3  is  much  shorter  than  the  run-time 
of  sample  run  2.  and  also  sample  run  5  is  much  faster  than  sample  run  4.  Thus.  RR  is  very  sensitive  to 
what  is  given  as  examples,  and  how  these  are  ordered.  In  this  section,  we  are  interested  in  how  to 
teach  RR  effectively. 

First,  we  consider  the  worst  case  and  the  best  case  of  re-construction.  In  the  worst  case,  RR  calls 
add-trivially  and  cut-wrong-arrow  again  and  again,  and  eventually  its  machine  becomes  the  trivial 
machine.10  We  know  that  a  trivial  machine  can  be  constructed  easily  without  such  a  special 
technique  as  re  construction. 

On  the  other  hand,  the  best  case  is  that  RR  calls  add-trivially  once  but  no  further  cut-wrong-arrow. 
Thus,  in  order  to  "teach"  the  RR  system  effectively,  we  have  to  choose  the  examples  nicely  so  that 
RR  can  re-construct  its  machine  only  by  add-trivially.  For  instance,  the  example  inputs  of  sample  run 
3  and  sample  run  5  are  so  chosen,  and  their  run-time  is  in  fact  very  short.  Also,  to  avoid  calling 
cut-wrong-arrow,  we  had  better  give  the  negative  examples  earlier. 


10 


A  trivial  machine  is  a  machine  that  accepts  exactly  all  strings  in  the  right-list  and  nothing  else  See  chapter  3. 
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3.  Concluding  Remark 

Our  new  approach  to  construction  of  finite  automata  from  given  exampies  has  been  shown  to  work 
very  nicely,  despite  the  fact  that  its  algorithm  is  quite  simple.  In  chapter  2.  we  saw  that  construction  of 
finite  automata  with  n  states  can  be  nicely  done  using  hill-climbing  if  n  is  a  reasonable  number.  In 
chapter  3.  we  saw  that  we  could  often  simplify  the  resulting  machine  of  chapter  2  also  using  hill¬ 
climbing.  although  some  problems  could  not  be  solved.  In  chapter  4,  we  discussed  how  to  utilize  past 
work,  if  a  given  problem  is  very  close  to  the  past  problem.  The  RR  system,  which  uses  these 
techniques,  was  introduced  and  described  in  chapter  5.  Finally,  we  enumerate  several  extensions  of 
this  work. 

•  Our  hill-climbing  algorithm  sometimes  climbs  a  local  hill,  and  therefore  fails  to  find  a 
correct  solution.  There  are  several  ways  to  avoid  climbing  a  local  hill,  and  one  of  them  is 
adaptive  search  [Cavicchio  70],  [Holland  75].  Adaptive  search  can  be  considered  as  a 
powerful  version  of  hill-climbing.  There  are  not  only  one  "current  generation",  but 
usually  a  population  of  20-30 .  The  best  five  or  so  are  chosen  as  winners  (the  others  are 
discarded)  and  15-21.  slightly-altered  copies  of  them  are  made  as  the  new  population. 
Altering  way  is  not  only  mutation,  but  also  cross-over  (mix  two  and  produce  one),  inverse 
(inverse  a  certain  part  of  one)  ,”  and  so  on.  This  approach  becomes  really  powerful  if 
parallel  computation  is  available. 

•  Our  finite  automata  have  been  deterministic,  that  is,  arrows  either  exist  or  do  not  exist. 

The  operator  create-arrow  or  delete-arrow  often  makes  too  much  difference  to  climb  hill 
smoothly.  The  idea  is  to  let  our  finite  automata  be  probabilistic,  that  is,  an  arrow  exists 
partially  with  a  real  number  between  0.0  and  1.0,  which  indicates  a  probability  of 
existence  of  the  arrow.  (See  [Rabin  63].)  In  this  case,  we  increase  or  decrease  the  real 
numbers,  rather  than  create  or  delete  an  arrow.  This  method  might  help  to  climb  hills 
smoothly. 

•  Our  mutation  function  might  be  modified  so  that  the  mutation  does  not  take  place 
completely  randomly,  but  somewhat  "cleverly".  For  instance,  if  the  machine  accepts  a 
string  in  the  wrong-list,  then  delete-arrow  or  decrease- prob- of -arrow  should  take  place 
more  often  on  this  wrong  path  than  on  others.  Our  idea  becomes  more  concrete  if  we 
deal  with  the  probabilistic  automata  described  in  the  previous  paragraph.  If  the  machine 
somehow  accepts  a  string  in  the  wrong-list,  then  we  should  decrease  all  probabilities  of 
the  arrows  on  this  path.  If  the  machine  accepts  a  string  in  the  right-list,  we  increase  the 
probabilities  on  this  path,  etc. 

•  Our  problem  domain  in  this  paper  has  been  regular  sets.  It  might  be  possible  to  extend  it 
to  context-free  sets  by  constructing  Push-Down  Automata  (finite  automata  with  stack,  see 
[Hopcroft  79]).  Since  construction  of  Push-Down  Automata  must  be  much  harder  than 
finite  automata,  we  would  definitely  need  techniques  just  listed. 

•  A  finite  automaton  can  be  viewed  as  a  program  that  takes  a  string  as  its  argument  and 
outputs  TRUE  or  FALSE.  Therefore  we  might  be  able  somehow  to  apply  our  method  to 
automatic  programming  from  specification  by  examples. 


1  'The  cross-over  operator  acts  on  a  pair  of  strings  by  breaking  each  string  at  some  point  and  rejoining  the  subsegments 
from  different  strings.  The  inversion  operator  makes  two  breaks,  inverts  the  inner  segment  and  then  retom  the  string. 
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